Scaling Interpretability
anthropic.com·2h·
Discuss: Hacker News